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1.  Introduction 

Geometric  programming  Is  a  branch  of  nonlinear  programming  that 
considers  certain  primal  nonllnearly  constrained  convex  programs  and  their 
associated  dual  linearly  constrained  convex  programs.  The  development  of 
geometric  programming  Is  largely  the  work  of  Duffln,  Peterson,  and  Zener  [5]. 
The  strong  duality  theory  [4,5]  of  geometric  programming  states  that,  under 
mild  conditions,  to  solve  a  primal  one  can  solve  Its  dual  and  then  solve  a 
single  system  of  linear  equations,  known  as  the  invariance  conditions.  Since 
many  practical  problems,  especially  in  optimal  engineering  design,  [16,17], 
are  naturally  cast  as  primal  problems,  a  major  computational  advantage  of 
this  duality  Is  that,  when  the  dual  Is  solved  with  a  feasible  direction 
method,  the  linear  equality  constraints  restrict  the  choice  of  feasible  direc¬ 
tions.  For  such  methods,  the  computational  effort  required  to  solve  the  dual 
Increases  with  the  dimension,  called  the  degree  of  dif ficul ty.  of  the  dual 
feasible  region. 

To  solve  a  primal,  rather  than  solving  its  dual  directly,  we  propose 


solving  the  duals  of  a  sequence  of  approximating  primals.  Each  approximating 
primal  is  obtained  from  the  primal  by  using  the  current  primal  solution 
estimate  to  condense  [4]  certain  terms  In  the  constraints  and  the  objective 


function,  as  specified  by  a  canonical  submatrix  of  the  matrix  of  exponents. 


The  dual  of  each  approximating  primal  has  fewer  degrees  of  difficulty  than 
the  dual  of  the  given  primal.  Therefore,  our  scheme  is  to  solve  a  sequence 
of  problems,  each  pos&.d  in  the  same  low  dimensional  space,  rather  than  solve 
one  problem  in  a  higher  dimensional  space. 

In  this  paper,  all  vectors  are  column  vectors,  unless  otherwise 
specified,  and  x'  and  A'  denote  the  transpose  of  the  vector  x  and  the 
matrix  A,  respectively.  If  x,  y  i  Rm,  by  x  ■  y,  x  J>  y,  and  x  >  y,  we 
mean  x^  ■  y^,  x^  >_  y^,  and  x^  >  y^,  respectively,  for  j  -  l,2,...,m.  The 
end  of  a  proof  is  denoted  by  «£. 


2.  Primal  and  Dual  Geometric  Programs 

Geometric  programming  studies  pairs  (P  ,  D)  of  optimization  problems. 
The  primal  P  is  defined  on  Rm: 


where 


minimize 
subject  to 


h0(x> 

hk(jc)  £  1, 
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(k)  are  successive  blocks  of  the  integers  1,  2,  ...  ,  n: 

where  «q  ■  1#  -  nQ  +  1,  ^  ^  +  1 .  mp  -  n  j+1, 

is  positive  for  1  ■  1,2,...,  n,  and  a^  ■  (a^,  ai2,  ... 
row  vector,  with  a^  unconstrained  in  sign  for  each  i 
means  a^x^.  Each  h^  is  called  a  posynomlal,  and 


[k]  -  {n^,  mj+1,. . . 

•  •  •  %  n  *  n »  c. 
p  i 

,  aim)  is  a 

and  j .  Here  a^x 

h,  is  said  to  have 
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I  [k] |  tim,  where  |s|  denotes  the  nuaber  of  elements  in  set  S.  The 
priaal  Is  said  to  have  n  terns,  since  n  -  ££  l[k]|. 

The  duel  D  Is  defined  on  Rn: 


where 


and 


maximize 


subject  to 


6  €  D 


Km  l  6i  for  k  ■  p, 

*  i€[k] 

D  -  {6|«'A  -  0,  l  6  -  1,  and  6  >  0) 

1U0]  1 


is  the  n  x  m  matrix  of  exponents.  Here  6’ A  denotes  the  m-dimensional  row 

vector  whose  J—  coordinate  Is  T?  ,  fi,a,., 

Li"l  i  ij 

Clearly,  P  is  a  convex  program,  as  is  D  when  v(6)  is  replaced 
with  -log  v(6)  [51.  From  [4,5]  we  learn  that  if  x  is  feasible  for  P 
and  6  la  feasible  for  its  dual  D,  then  hQ(x)  >  v(6).  Under  the  same 
conditions,  equality  holds  if  and  only  if  the  ^invariancc^ 
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•re  satisfied.  Taking  logarithms  of  each  equation  generates  a  system  of 
equations  linear  In  x.  Since  6^  >  0  for  some  1  In  [k]  If  and  only 
If  5^  >  0  for  each  1  in  Ik],  the  invariance  conditions  are  a  set  of 
I [OJ |  +  f R  l[k]|  nontrivial  equations  in  m  unknowns,  where 
K+  -  {k|*k  >  0,  k  -  1,2 . p}. 

A  program  is  consistent  if  its  feasible  region  is  nonempty.  We  say 
P  is  super consistent  if  for  some  x®  we  have  hk(x®)  <  1»  k  ■  1,2,...,  p. 

If  P  is  consistent  and  has  a  positive  infimum  u>,  then  its  dual  is  consistent 
and  has  a  finite  supremum  £,  and  w  ■  £.  If  also  P  is  super consistent , 
then  £  is  attained  at  a  dual  feasible  point  [4,5]. 

We  are  interested  in  uniqueness  of  solution  in  geometric  programming. 
Recall  that  the  posynomlal  h  is  strictly  convex  if  its  exponent  matrix  has 
full  rank  (see  p.  31,  Zener  (17]}.  We  write 


where  Ajkj  contains  those  rows  of  A  corresponding  to  h^, 
By  a  Kuhn-Tucker  pair  (z,u)  for  P  we  mean  that  z 
u  is  a  vector  of  Lagrange  multipliers  for  P. 


for  k  -  0,1,. . . , 
solves  P  and 
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Proposition  1.  For  aos*  Kuhn-Tucker  pair  (*,u)  of  P,  1st  K  bs  that  aub- 
aatrix  of  A  obtainad  by  dalating  A^j  for  sach  k  satisfying  •  0. 

If  rank  E  ■  a,  than  s  is  tha  uniqua  solution  of  P. 

Proof.  Sines  (a,u)  is  a  Kuhn-Tucker  pair  for  P,  than  s  minimises 


i 


ci# 


•ix 


Sines  u  >  0  and  rank  E  ■  a,  it  follows  that  s  is  tha  uniqua  minimum  of 
this  function.  By  Corollary  28.1.1,  Rockafellar  [12],  s  Is  tha  uniqua  solu¬ 
tion  of  P.  • 

Proposition  2.  Suppose  P  is  super cons latent  and  has  the  unique  Kuhn-Tucker 
pair  (»,u).  Let  E  be  as  in  Proposition  1.  If  rank  K  -  m,  then  the  dual 
D  has  a  uniqua  solution. 

Proof.  Each  Lagrange  multiplier  u^  is  related  to  the  variable 
*k  ■  (  {kj  6i  of  th*  dual  by  \  ■  u^/hgfs),  k  ■  1.2.- P  (»«•  p.  H*. 

Duff in,  Peterson,  and  Zener  [5]).  Since  each  u^  is  uniquely  determined, 
than  so  is  each  X^.  By  tha  strong  duality  theory,  the  dual  has  at  least 
one  solution.  Each  dual  solution  is  related  to  s  by  the  invariance  con¬ 
ditions.  Since  ■  is  tha  unique  solution  of  P  and  rank  E  -  n,  the 
invariance  conditions  gsnarate  the  unique  solution  of  the  dual.  ^ 

Proposition  1  implies  that  tha  ability  to  determine  in  advance  which 
primal  constraints  must  be  active  at  any  primal  solution  can  prove  a  priori 
that  the  primal  has  a  unique  solution.  The  monotonicity  analysis  of  Wilde  (16) 


5 


and  Papalambros  (11]  la  vary  useful  for  determining  conatralnt  activity. 
Proposition  1  la  Important  because  proving  global  convergence  of  our  method 
requlrea  a  priori  knowledge  that  the  given  primal  and  each  approximating 
primal  have  unique  solutions.  Proposition  2  asserts  that,  under  reasonable 
conditions,  the  dual  has  a  unique  solution  If  the  primal  does.  This  theorem 
Is  useful  for  our  method,  which  solves  only  dual  problems. 


3*  Canonical  Prlmals  and  Degrees  of  Difficulty 

Our  method  can  be  applied  to  a  geometric  primal  P  only  If  P 

Is  canonical,  that  Is,  If  Its  solution  set  is  nonempty  and  bounded.  It  Is 

reasonable  to  expect  a  well-formulated  problem  to  be  canonical.  If  P  is 

consistent  but  not  canonical,  then  the  solution  set  Is  either  empty  or  in- 

bounded.  In  the  latter  case,  by  deleting  terms  or  variables,  we  can  reduce 
P  to  a  canonical  program  (1,5], 

Proposition  3.  Suppose  P  is  consistent.  Then  the  following  are  equivalent 

1)  P  is  canonical 

2)  Ad  £  0  implies  d  ■  0 

3)  rank  A  ■  m  and  d'A  ■  0,  5  >0  has  a  solution. 

Proof.  1)  <-->  2).  See  Abrama  (1).  Notice  that  "Ad  <  0  implies  d  -  0" 

says  that  the  objective  function  and  constraints  have  no  cosason  direction  of 
receaaion.  2)  <■■>  3).  By  Stlenke'a  Theorem  [9],  Ad  0,  Ad  +  0  has  no  so lu 
tion  If  and  only  If  d'A  ■  0,  5  >  0  has  a  solution.  Moreover,  Ad  -  0  has 
no  nonsero  solution  if  and  only  If  rank  A  *  m.  • 


We  write 


I£  P  is  consistent,  its  feasible  region  is  bounded  if  and  only  if 
A^  Qjd  <_  0  implies  d  -  0(1]. 

Notice  that,  for  a  consistent  primal,  the  cost  coefficients  {c^} 
play  no  role  in  determining  whether  or  not  P  is  canonical.  This  is  a 
wonderful  feature,  since  cost  coefficients  are  usually  not  known  exactly, 
while  the  exponent  matrix  is  usually  determined  from  the  laws  of  science 
and  therefore  is  often  known  exactly. 

From  Proposition  3  and  the  strong  duality  theory,  it  is  easy  to 
prove  that,  if  P  is  canonical,  its  dual  is  consistent  and  has  n-m-1 
degrees  of  difficulty.  If,  for  each  dual  feasible  point,  one  or  more  com¬ 
ponents  are  always  zero,  then,  as  discussed  in  Abrams  [1],  these  components 
and  the  corresponding  primal  terms  can  be  dropped.  In  this  case,  the  degree 
of  difficulty  is  strictly  less  than  n-m-1. 

Notice  that,  for  a  fixed  number  of  primal  variables,  the  degree  of 
difficulty  increases  with  the  number  of  primal  terms.  Although  geometric 
programming  succeeds  with  duals  of  small  degree  of  difficulty,  turning  to 
the  dual  can  In  many  cases  produce  unreasonably  large  problems.  Our  method 
modifies  the  primal  to  reduce  the  number  of  terms,  thus  lowering  the  degree 
of  difficulty  of  its  dual. 
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Condensation  and  Canonical  Submat rices 


The  key  to  reducing  the  number  of  primal  terms  is  a  technique  called 

condensat ion .  discovered  by  Duffln  [4].  Wilde  [16]  has  used  condensation  to 

simplify  problems  in  optimal  engineering  design. 

Condensation  maps  a  posynomial  into  a  one-term  posynomlal.  Consider 
n  *ix 

h(x)  -  e^e  •  For  every  x,  each  term  of  h(x)  is  positive.  Given 

the  point  y,  form  the  vector  e  •  (e^,  e^,  ...  ,  e  ) ,  where 


n. 


Since  e.  >0  for  each  i  and  j*  t  »  1,  the  vector  e  is  called  a 
i  *'i"l  i 

vector  of  primal  weights  for  h  at  v.  We  define  the  total  condensation 
of  h  about  y  to  be  the  function  h(’,y)  given  by  h(x,y)  ■  ceax,  where 


"  /  Ci\€i  n 

c  .  n  I  -M  and  a  -  £  e  a  . 

i-1  i  i-1 


Notice  that  h(*,y)  is  a  one-term  posynomial.  It  is  well-known  [2,4] 
that  h(y)  -  h(y,y),  Vh(y)  -  ?jh(y,y)  (where  V  denotes  the  gradient  with 
respect  to  the  first  argument),  and  h(x)  h(x,y)  for  every  x. 

Me  now  present  a  scheme  for  condensing  a  canonical  primal  to  reduce 
the  degree  of  difficulty.  Section  3  proved  that  a  consistent  primal  F 
is  canonical  if  and  only  if  its  exponent  matrix  A  satisfies  Ad  0 

implies  d  •  0.  Suppose  now  that  P  is  consistent  and  canonical.  One  can 
often  delete  rows  of  A  so  that  the  resulting  matrix,  denoted  by  AC, 
satlsifes  the  following  conditions: 


1)  ACd  <  0  implies  d  -  0 

2)  at  luit  two  row*  of  Ajk j  are  deleted  for  iom  k  in  {0,1,...,  p). 

If  AC  satisfies  1)  and  2),  we  call  Ac  a  canonical  submatrix  of  A.  (This 
definition  diffara  somewhat  froa  that  in  Roaanberg  [14].) 

The  concept  of  a  canonical  aubaatrlx  haa  a  simple  geometric  inter¬ 
pretation.  Consider  a  canonical  unconstrained  P  whose  objective  function 
has  two  variables  and  five  terms.  By  Proposition  3,  the  matrix  A  must  have 
full  rank,  and  d'A  -  0,  >5  >0  auat  have  a  solution.  Such  a  situation  Is 
shown  in  Figure  1.  Notice  that  a^,  a^,  and  a^  form  a  canonical  submatrix. 

In  contrast,  a^,  a5,  and  a^  do  not. 

A  canonical  submatrix  A  tells  which  terms  of  P  can  be  condensed 

without  P  becoming  noncanonical.  For  each  k  ■  0,1,...,  p,  collect  each 

(* 

term  whose  exponent  vector  a^  was  deleted  from  A  to  form  A  .  Add  these  terms 

T  T 

together  to  build  a  posynomial,  which  we  call  h^.  Equivalently,  h^  is 

obtained  by  deleting  from  h^  each  term  whose  exponent  vector  appears  In  A*" . 

m  T 

Given  the  point  v  in  R  ,  totally  condense  h^  about  y  to  form 

-T 

h^(*,y).  For  each  k  -  0,  1,  ...  ,  p,  let 

hk(*..v)  ■  (h^U)  -  h*(x)>  +  h£(x,y)  . 

That  is,  h^(’,y)  is  obtained  from  h^  by  totally  condensing  about  y 
all  terms  with  exponent  vectors  not  appearing  in  A  .  Finally,  define  the 
condensed  primal  P(y) : 
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minimize  hg(x,y) 
x 

subject  Co  h^(x,y)  <_  1,  k  -  1,2 . .  . 

IC  follows  Immediately  from  the  above  remarks  on  condensation  that, 
for  every  y,  P(y)  has  a  larger  feasible  region  and  a  (polntwise)  smaller 
objective  function  than  P.  Moreover,  P(y)  is  canonical  for  every  y, 
since  the  exponent  matrix  of  P( y)  has  (after  possibly  reordering  the  rows) 
the  form 

(“) 

for  some  matrix  W(y).  Since  Acd  .<  0  Implies  d  -  0,  then  P(y)  is 
canonical,  by  Proposition  3.  Moreover,  condition  2)  of  the  definition  of 
a  canonical  submatrix  implies  that  P(y)  has  fewer  terms  than  P,  so  that 
its  dual  has  fewer  degrees  of  difficulty.  An  example  of  how  to  generate  a 
condensed  primal  is  presented  in  Section  7,  along  with  numerical  results. 


5.  Obtaining  a  Canonical  Submatrix 

The  following  is  a  procedure  that  searches  for  a  canonical  submatrix. 
Recall  that  a  matrix  Ac  obtained  by  deleting  rows  of  A  is  a  canonical 
submatrix  if 

1)  Acd  <_  0  implies  d  ■  0  and 

2)  at  least  two  rows  of  A^j  are  deleted  for  some  k  in  {0,1,..., p). 


K 
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The  procedure  uses  linear  programming  to  search  for  a  submatrix  of  A  satisfying 

1) *  If  the  aubmatrix  also  satisfies  2),  then  it  is  a  canonical  submatrix. 

We  observed  in  Section  3  that  P  ia  canonical  if  d'A  -  0,  d  >  0 
has  a  solution  and  rank  A  -  m. 

Consider  the  Phase  I  [3]  linear  program: 


minimise 

e’w 

m 

+  y 

subject  to 

A’d 

+  w  -  0 

e'd 

n 

+  y  -  l 

(d  . 

w,  y)  >  0 

r 

where  e  t  R  is  a  column  vector  of  ones.  Notice  that  l.P  has  nH-1  con- 
r 

8tralnts.  Clearly  LP  is  consistent,  Let  y  be  the  optimal  objective  function 
value  of  LP.  Then  y  >  0. 

Proposition  A.  If  P  is  canonical,  then  y  ■  0. 

Proof .  If  P  is  canonical,  then  d'A  -  0,  d  >  0  has  a  solution.  Therefore 

d'A  -  0,  e'd  ■  l,  d  >  0  has  a  solution,  so  that  every  solution  of  LP  has 
n  — 

w  -  0  and  y  ■  0,  Hence  y  «■  0. 

By  Proposition  A,  if  y  >  0  then  P  is  not  canonical;  hence  no 
canonical  submatrix  exists.  In  this  case,  reduction  may  be  desirable  (see 
Abrams  (11).  Notice  that 
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provide*  a  counterexample  to  the  converge  of  Proposition  4:  P  is  not 
canonical,  since  rank  A  ■  1,  yet  6  -  (1/2,  1/2)',  w  -  (0,0)',  y  -  0 
solves  LP  and  y  •  0. 

Suppose  the  simplex  method  (3}  used  to  solve  LP  gives  y  ■  0.  if 
m+1  components  of  6  are  positive  in  the  optimal  solution  of  LP,  then  the 
corresponding  rows  of  A  constitute  a  canonical  submatrix  if  condition  2) 
is  satisfied.  Unfortunately,  degenerate  aolutions  of  LP  are  poaslble,  even 
when  P  is  canonical.  For  example,  in  Figure  1,  where  *2  ■  -a^ ,  the 
solution  $2  ■  6^  •  1/2;  6  ■  0,  i  /  2,  5  is  a  degenerate  solution. 

Termination  in  the  degenerate  case  does  not  autoeutically  imply  failure. 

For  it  may  be  possible  to  find  alternative  nondegenerate  solutions  of  LP  by, 
for  example,  bringing  into  the  basis  a  column  with  sero  reduced  cost.  It 
is  possible  to  systematically  determine  all  basic  solutions  of  LP  (see  [6) 
and  references  therein)  but  such  a  procedure  is  generally  unnecessary. 


6.  The  Algorithm  and  a  Stopping  Criterion 

This  eectlon  presents  the  condensation  algorithm  and  discusses  some 
of  the  details  of  Implementation.  We  also  describe  how  the  algorithm  auto¬ 
matically  generates  a  sequence  of  lower  bounds,  thus  suggesting  a  stopping 
criterion  that  may  substantially  reduce  computation. 

To  execute  the  algorithm  for  solving  P,  we  require  that  P  be 
super consistent  and  that  a  canonical  submatrix  Ac  be  available.  If 
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P  satisfies  only  these  two  conditions  then  the  algorltha  can  be  executed. 
However,  to  prove  the  algorltha  globally  convergent,  we  aust  impose  additional 
assumptions.  Failure  to  establish  that  aone  given  P  satisfies  these  extra 
assumptions  should  not  discourage  use  of  the  algorithm.  To  see  how  the 
algorltha  might  still  be  of  use,  suppose  that  we  take  an  arbitrary  point  x 
as  the  first  solution  estimate.  Since  a  canonical  submatrix  is  available, 
then  P(x)  is  canonical  and  has  a  dual  easier  to  solve  than  the  dual  of  P. 

It  might  be  that  a  solution  of  P(x),  obtained  after  relatively  small  effort 
by  solving  the  dual  of  P(x)  •  also  qualifies  as  an  acceptable  solution  of 
P,  so  that  no  further  computation  is  necessary.  The  Important  word  here 
is  ''acceptable."  "He  that  knoweth  when  he  hath  enough  is  no  fool."  (John 
Heyvood  (1546)  as  quoted  in  Wilde  [16].)  For  example,  it  is  foolish  to 
compute  an  extremely  accurate  solution  to  a  problem  formulated  with  Inaccurate 
data. 

We  now  discuss  the  extra  assumptions  needed  to  insure  global  con¬ 
vergence.  First  is  that  P  have  a  bounded  feasible  region.  As  mentioned 
in  Section  3,  this  is  true  If  and  only  if 

0]**  —  ®  implies  d  »  0  , 

where  A ^  ^  is  that  submatrix  of  A  obtained  by  deleting  the  exponent 
vector  of  each  objective  function  term.  If  our  problem  fails  to  satisfy 
this  condition,  we  could  impose  additional  constraints  bounding  the  feasible 
region  so  that  no  solution  of  P  violates  the  additional  constraints.  Also, 
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P  and  mh  P(x)  (that  la,  for  any  x)  must  ba  known  in  advance  to  have 
unique  eolutiona.  Sufficient  conditlona  for  a  primal  to  have  a  unique  solution 
have  been  derived  in  Section  2.  Aa  mentioned  there,  verifying  that  a  primal 
has  a  unique  solution  can  often  be  accomplished  by  determining  constraint 
activity. 

A  solution  of  £(x)  deflnea  a  descent  direction  of  an  exact  penalty 
function.  These  interesting  functions  have  been  well  studied  (see  [8],  [13], 
and  references  therein).  Here  we  simply  define  0^,  the  exact  penalty  function 
of  the  primal  P,  by 

6  (x)  -  hQ(x)  +  p  £  max(0,  h^x)  -  1)  , 
p  k*l 

where  p  is  a  fixed  positive  number. 

Notice  that  8  is  convex  and  nondifferentlable.  Our  method  for 
P 

solving  P  requires  a  line  search  on  6^  at  each  iteration.  Fibonacci  or 
golden  section  [IS]  or  super linearly  convergent  [10]  line  search  routines  can 
be  utilized. 

Our  algorithm  is  the  following.  Let  the  positive  numbers  p  and  6 
be  given.  Choose  any  Xq  in  R®. 

Algorithm  CPA.  For  i  ■  0,  1,  2,  ... 

1)  solve  the  condensed  primal  P(x^)  to  obtain  a  solution  z^;  let 


2)  find  an  such  that 


15 


a.  <  arg  min  0  (x.  +  ad.)  ; 
1  0<a<$  p  1  1 


i«  ,l+1 .  *  .1d1 


3)  STOP  if  "  Xj?  otherwise,  go  to  1)  with  i  replaced  by  i+1. 


We  now  present  the  global  convergence  theorem. 


Theorem  1.  Suppose  that  P  is  superconsistent  and  has  a  bounded  feasible 
region.  Suppose  also  that  P  and  P(x)  have  unique  solutions  for  any  x. 

Then  there  is  a  positive  number  pQ  such  that,  whenever  p  2  Pq»  algorithm  GPA 
either  stops  at  the  unique  solution  of  P  or  generates  a  sequence  (x^l 
converging  to  it. 

Proof.  Since  condensation  preserves  function  value  and  first  partial 
derivatives  at  the  point  of  condensation,  and  since  the  objective  function 
hg  is  everywhere  positive,  the  result  follows  immediately  from  this  author's 
convergence  theory  [13  J.  <*& 


Notice  that  the  success  of  algorithm  GPA  depends  on  choosing  a 
sufficiently  large  value  for  p.  We  offer  no  a  priori  advice  on  how  to 
choose  p,  since  the  threshold  value  pg  Is  actually  a  function  of  the 
solution  set  of  P,  of  course  unknown. 

An  important  feature  of  algorithm  GPA  is  that  it  automatically  pro¬ 
duces  a  sequence  of  lower  bounds  on  8^,  the  exact  penalty  function  associated 
with  P.  We  now  characterize  this  sequence  and  explain  how  to  use  it  as  a 
very  effective  stopping  criterion. 
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Ac  each  itaration  1  of  GPAt  va  aolve  ?(x^)  (via  its  dual)  Co  obCain 
a  solution  z^.  The  optimal  objaedva  function  value  of  £(x^)  is  Kq^.x^). 

Since  each  function  defining  P(x^)  undarastiaatea  Che  corresponding  function 
defining  P,  it  must  be  Chat  hg(z).>  hg(z^,x^),  where  hg(z)  is  the  optimal 
objective  function  value  of  P.  Alao,  alnce  descreases  monotonically 

to  8p(z)  *  hp(z),  then  8p(x^)_>  hg(z).  Combining  the  above,  we  have 
0p(xi)_>ho(*)_?  hgtz^.x^).  This  shows  that  the  sequences  {©^(x^) )  and  (iig(z  ,xA) 
bound  hg (z)  from  above  and  below,  respectively.  Since  z  solves  ?(z),  the 
bounds  are  tight,  that  is,  8p(*)  “  hg(z)  -  hg(z,z). 

We  cannot  say  that  the  sequence  (hg(z^,x^) )  is  monotone  Increasing. 

However,  non-monotonlc  behavior  occurred  in  only  one  test  problem,  suggesting 
that  non-monotonic  behavior  is  the  exception  rather  than  the  rule. 

The  lower  bounds  provide  an  excellent  stopping  criterion:  accept 
x1  as  a  solution  of  P  if  *  meXg  <  j  <  ^  hg(z^,Xj)  —  el*  for  80mB 

positive  number  e^.  This  test  la  especially  useful  when  the  objective 
function  is  not  known  to  great  accuracy,  aa  in  optimal  design  problems. 

This  criterion  might  be  used  in  conjunction  with  the  requirement  that 

not  violate  any  constraint  by  more  than  *or  some  positive  number  e2* 

Notice  that  these  lower  bounds  are  not  generated  when  the  primal  is  solved 
by  the  well-known  technique  of  solving  quadratic  subproblems  in  an  active 
set  strategy  [7]  ;  indeed,  the  generation  of  these  lower  bounds  is  a  very 
special  feature  of  our  method. 


Che  data  or  parameters  and  conducting  sensitivity  analysis  is  easy. 

We  solve  each  condensed  dual  by  obtaining  a  feasible  point  and 
applying  a  projected  Newton-like  method  (see  Chapter  3,  Gill  and  Murray  [7]). 
This  feasible  direction  method  requires  a  line  search  on  the  condensed  dual 
objective  function  at  each  iteration.  We  employ  the  golden  section  method  [15] 
to  solve  the  line  search.  The  original  interval  of  uncertainty  is  chosen  to 
be  mln(AtB),  where 

A  -  the  minimum  step  along  the  condensed  dual  search  direction 
that  causes  some  component  of  the  solution  estimate  to  hit 
aero 

and 

B  ■  100  times  the  length  of  the  condensed  dual  search  direction. 

Each  line  search  is  stopped  when  the  values  of  the  function  at  either  end 

—8 

of  the  current  Interval  of  uncertainty  differ  by  no  more  than  10  .  Each 

condensed  dual  optimisation  is  stopped  when  successive  line  searches  yield  con- 

-8 

densed  dual  objective  function  value  estimates  that  differ  by  no  more  than  10 

Our  experience  indicates  that  these  tolerances  cannot  be  greatly 

-3 

increased,  since  chenglng  them  to  10  led  to  jamming  at  x^.  That  is, 
we  obtained  x^  ■  Xj  ■  •  ••• 
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We  also  usad  tha  golden  sactlon  aachod  for  each  exact  penalty 

function  line  search.  We  choose  8  ■  10;  this  valve  is  more  than  adequate. 

Bach  exact  penalty  function  line  search  la  stopped  when  the  values  of 

the  function  at  either  end  of  the  current  Interval  of  uncertainty  differ 

by  no  mors  than  10  •  Our  axparlanca  indicate*  that  this  tolaranca  may  be 

.1 

increased  to  10  with  no  significant  detriaental  effect.  However,  we  chose 
to  use  the  forner  value. 

For  our  test  problems  we  chose  p  -  400.  This  choice  was  found  to  be 

adequate.  According  to  the  theory  in  [13],  nothing  la  gained  by  increasing  P 

beyond  the  threshold  value  Pq.  Indeed,  increasing  p  to  1000  in  one  problem 
produced  a  sequence  of  solution  estimates  identical  to  the  fifth  decimal  place. 

Representative  computer  results  are  given  below.  By  6^  we  mean  the 

exact  penalty  function  corresponding  to  the  given  primal  P.  All  entries  in 

the  table  are  rounded  to  five  significant  figures.  At  each  iteration  1,  we 
supply 

a.  solves  P(x^) 

b.  0  (x... ),  where  x..,  solves  the  line  search  on  8 

p  l+l  i+i  p 

c.  the  objective  function  and  constraints  of  P  at  xi+^ 

d.  the  lower  bound  h^ft^x^. 
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Problea 


«ln lali*  a 


Vx2 


subject  to  Yf  •  ♦  YT 


1  *X1  2  Xlf*X2  3  X2~X1  4  2xj+Xj 


*  11  * 


+  n* 


<  i 


j  -x^  ^  xl~x2  7  *l~^x2  8  2x,-x. 


27  *  +27* 


*  27  * 


1  2 

+  li*  i1  * 


This  problem  Is  superconsistent ,  canonical,  and  has  six  degrees  of  difficulty. 
Both  constraints  are  active  at  the  unique  solution. 


Method.  Condense  the  last  three  terns  of  each  constraint.  Given  v, 
P(y)  is 


alnlalie  a 


l  *2 


1  *1  -  *ix 

subject  to  —  *  ♦  c.e  £  1 


5  -x2  -  *2X 

if9  ♦  c2*  i 1 


where 


*i  *  (er  c2*  *3^ 


(  I  :)• 
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'*-5 


and 


(•jt  e2,  e3)  - 


-1/2  yi+2y2  3  y2-yi  4  2yl+y2\ 

(u  *  »  IT  e  »  11  e  J  * 


,  y.+2y9  ,  y,-y.  ,  2yi+y9 

q  ^  —  a  x  ■  -  p  x  — »■  ■»  p 

5  11 e  11  11 


a2  *  ^1*  ®2*  ^3^ 


0  3)  • 


o  „ ,  yry2  J_  yr2y2  JL  2yry 2\ 

(B1*  B2*  B3)  "  T  ^27  e  *  27  »  27  6  J  » 

6  yry2  7  yr2y2  8  2yi“y2 
T  “  27  6  +27e  +27e 


Each  ?(y)  Is  superconsistent ,  canonical,  and  has  two  degrees  of  difficulty. 

We  chose  p  ■  400  and  Xq  ■  (4,6)  which  yields  ®p(xg^  “  8.2  x  *0B, 
hi(x0)  -  2.0  x  106,  and  h2(xQ)  *2.2.  The  following  table  shows  that  four 
iterations  produce  an  x^  feasible  for  the  original  problem  such  that  hg(x^) 
is  within  0.1  percent  of  the  optimal  objective  function  value.  Notice  that  the 
lower  bounds  hg(z^,x^)  proved  very  useful  in  determining  when  an  acceptable 
solution  is  at  hand. 
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1 

W 

vw 

VW 

hl^Xi+l^ 

h2(xi+l) 

VvV 

1 

535.13 

30.636 

0.21623 

1.0761 

0.49853 

0.017098 

1 

29.496 

7.7871 

0.089265 

1.0192 

0.85597 

0.065315 

9 

0.67405 

0.25104 

0.073065 

1.0004 

1.0000 

0.072943 

9 

0.078688 

0.073136 

0.073136 

1.0000 

0.99989 

0.073124 

In  conclusion,  we  have  shown  that  solving  condensed  geometric  programs 
In  our  globally  convergent  scheme,  combined  with  a  lower  bound  stopping 
criterion  using  the  knowledge  of  how  accurate  a  solution  it  is  worthwhile 
to  compute,  provide  a  powerful  new  means  of  solving  geometric  programs. 

Acknowledgment .  The  author  is  grateful  to  Douglass  Wilde  for  his  helpful 
suggestions  on  a  preliminary  draft  of  this  paper. 
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